VM Fingerprinting
Linking two or more locally compromised Virtual Machines (VMs) to the same pseudonym.
Threat Model[edit]
In computing terms, the threat model concerns potential threats or structural vulnerabilities that can be exploited by adversaries; in the case of Whonix this means possible identifiers leading to full or partial deanonymization.
VM fingerprinting threats arise when two VMs are compromised, for instance:
- VMs infected with malware.
- VMs that have software locally installed which includes anti-features such as privacy-intrusive, tracking elements.
For the context of this wiki page, in short called, tracking software.
In this case, the goal of the adversary is to link two or more VMs to the same pseudonym.
Note:
- This is a general issue. This is not a Whonix specific problem. All operating systems running inside virtualizers are affected.
- This is not a VM / virtualizer specific problem. All VMs / virtualizers are affected.
- When running an operating system on hardware without any VMs, i.e. if tracking software runs on hardware, the problem is even worse than when using VMs. That is because in that case, there is no compartmentalization through VMs and tracking software can access hardware serial numbers.
Related: Tor Browser and Whonix goals and non-goals
Definitions[edit]
Local Non-Deterministic Artifacts[edit]
Automatically Generated Local Non-Deterministic Artifacts[edit]
A number of non-deterministic artifacts are present in the various Whonix variants - consider the examples below:
- In Qubes-Whonix™, the contents of many log files like
/var/log/apt/history.log
usually contain text such asStart-Date: 2020-04-11 07:20:20
. This is visible in both the Qubes Template and in AppVMs/DispVMs based on that Template due to the usual Qubes root file system sharing. [1] - The file creation timestamps of numerous files will be unique in a Template, but also shared with multiple Qubes TemplateBased AppVMs/DispVMs that are based on that Template. [2]
- Hardware details can potentially be used for identification, including
/proc/cpuinfo
,/proc/bus
,/proc/scsi
and/sys
. For this reason, Whonix includes an opt-in (currently being tested)hide-hardware-info.service
systemd unit to limit this information to the root user only; see Restrict Hardware Information to Root. - The Protocol Leak Protection and Fingerprinting Protection chapter outlines a host of other possible fingerprinting identifiers.
- These variables are related to the reproducible builds movement which is working on preventing non-deterministic artifacts in packages and ultimately also iso, template, raw, and other images. It is also related to most operating systems not being stateless.
Please note this is not an exhaustive list.
User-generated Local Non-Deterministic Artifacts[edit]
Non-deterministic artifacts are also caused by any user modifications inside the VM, such as editing ~/.bashrc
or changing the default editor.
Attacks[edit]
Non-Deterministic Artifacts[edit]
As noted in the Threat Model introduction, non-deterministic local artifacts matter in:
- A) VMs infected with malware. And,
- B) VMs that have software locally installed which includes anti-features such as privacy-intrusive, tracking features.
For the context of this wiki page, in short both being referred to as, tracking software.
In that case, the impact on Non-Qubes-Whonix™ and Qubes-Whonix users is noted below.
Non-Qubes-Whonix[edit]
All VMs are "StandaloneVMs". [3] Automatically generated, non-deterministic, local artifacts are "somewhat" unique in every VM. At a minimum, an adversary could use these local artifacts to determine the build version of Whonix because these artifacts are shared among all[4] Whonix images of that specific version.
Further identifiers then depend on individual behavior as to what kind of user-generated, local, non-deterministic artifacts exist. For example, if users change the keyboard layout to German, set their shell to python, and install an adblocker in every Tor Browser instance inside of a VM, this will very likely create enough uniqueness to permit an adversary that has compromised multiple VMs to link all of these to the same pseudonym.
Qubes-Whonix[edit]
- TemplateBased AppVMs / DispVMs: There are a lot of automatically generated, non-deterministic, local artifacts in the root file system which will be shared by all TemplateBased AppVMs based on that Template, as well as in the root file system of every DispVM which is spawned from the Template upon which it is based. These local artifacts are unique to each Template but shared among all AppVMs / DispVMs that are based on that Template. Therefore tracking software (an adversary with access -- which can be via malware or locally installed applications which include anti-features) -- that compromised two VMs based on that Template can link these VMs to the same pseudonym.
- PVH StandaloneVMs: The impact depends on when the StandaloneVM was cloned from a Template.
- Right after creation download: Theoretically, this case could be better and might be similar to Non-Qubes-Whonix. However, in practice local artifacts are still created since Qubes automatically boots VMs right after download due to Qubes' qvm-features-request mechanism.
- Later cloned after starting the Template: A lot of automatically generated, non-deterministic, local artifacts will already exist.
- HVM StandaloneVMs: Probably very few users are utilizing these. Even if they are in use, these would probably not be based on
whonix-ws
, which would be even worse. If awhonix-ws
based HVM StandaloneVM is utilized, this might be a similar situation to Non-Qubes-Whonix.
Benchmarking[edit]
An adversary could benchmark the CPU, GPU, HDD, RAM, other hardware (such as the microphone, keyboard, and camera) and/or network connection to create a unique profile which is similar enough to link two compromised VMs to the same pseudonym.
Defenses[edit]
General[edit]
- Avoid non-freedom software since these are more likely (and often) contain privacy-intrusive, tracking anti-features.
- Related comment: Pitfalls of proprietary software
- Related comment: Difficulties in protecting against proprietary software
- Avoid exploitation and work on improving general security on the Whonix platform.
- In theory, System Identity Camouflage .
Non-Deterministic Artifacts[edit]
- A future goal is to consider the possibility of "clean" templates / images. "Clean" in this case refers to a template or image which is free of any non-deterministic artifacts. At a minimum this probably requires reproducible builds as a prerequisite. In addition, the operating system inside the VM should be stateless so that all users who do not customize their base VM (install any packages) look similar.
- security-misc has a feature currently in testing to Restrict Hardware Information to Root.
- Package security-misc from Whonix to Qubes
- Kicksecure™ in Qubes
- Strong Linux User Account Isolation
- Planned security features for Kicksecure such as untrusted root user.
- Unifying file creation timestamps during build might be beneficial. This will occur as a byproduct of eventual reproducible build efforts.
- Unifying file creation timestamps at boot might be beneficial.
- Qubes specific:
- During startup of TemplateBased AppVMs and DispVMs, logs that were generated in the Template could be purged. (During that boot phase, no tracking software can be active unless the Template was compromised.)
Benchmarking[edit]
Where possible, limits could be enforced for the CPU, GPU (theoretically), HDD, RAM and network connection.
Related ticket: virtualizer: enforce maximum system resources a virtual machine may use.
Such a feature could potentially be implemented in sandbox-app-launcher (development discussion). This would be useful in the case of buggy / misbehaving applications not accidentally DDOS'ing the host as well as compromised applications trying to benchmark the VM.
Forum Discussion[edit]
- VM Fingerprinting - linking two or more locally compromised VMs to the same pseudonym
- https://forums.whonix.org/t/qubes-identifiers/17994/2
- https://github.com/QubesOS/qubes-issues/issues/8833#issuecomment-1879967160
- https://forums.whonix.org/t/anonymize-etc-machine-id/7721
See Also[edit]
- System Identity Camouflage
- CPUID
- Distro Morphing vs Builds
- Fingerprint: Whonix vs other anonymity systems or tools
- Network, Browser and Website Fingerprint
- Protocol Leak Protection and Fingerprinting Protection
Footnotes[edit]
- ↑
Qubes debian-12 Template
cat /var/log/apt/history.log
Start-Date: 2020-04-11 07:20:20
- ↑
ls -la /etc/apt/trusted.gpg
Qubes whonix-ws based DispVMs:
-rw-r--r-- 1 root root 1138 Mar 7 09:39 /etc/apt/trusted.gpg
Qubes debian-12 based VM:
-rw-r--r-- 1 root root 1138 Mar 26 2018 /etc/apt/trusted.gpg
- ↑ Qubes:
In general terms, a VM is described as standalone if and only if it does not depend on any other VM for its root filesystem. (In other words, a VM is standalone if and only if it is not a TemplateBasedVM.)
- ↑ Or most.
We believe security software like Whonix needs to remain open source and independent. Would you help sustain and grow the project? Learn more about our 12 year success story and maybe DONATE!